BreakingNews: Article Annotation by Image and Text Processing

机译：BreakingNews：图像和文本处理的文章注释

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current approaches lying in the intersection of computer vision and NLP have achieved unprecedented breakthroughs in tasks like automatic captioning or image retrieval. Most of these methods, though, rely on training sets of images associated with annotations that specifically describe the visual content. This paper proposes going a step further and explores more complex cases where textual descriptions are loosely related to images. We focus on the particular domain of News. We introduce new deep learning methods that address source and popularity prediction, article illustration, and article geolocation. An adaptive CNN is proposed, that shares most of the structure for all tasks, and is suitable for multitask and transfer learning. Deep CCA is deployed for article illustration, and a new loss function based on Great Circle Distance is proposed for geolocation. Furthermore, we present BreakingNews, a novel dataset with approximately 100K news articles including images, text, captions, and enriched with heterogeneous meta-data. BreakingNews allows exploring all aforementioned problems, for which we provide baseline performances using various CNN architectures, and different representations of the textual and visual features. We report promising results and bring to light several limitations of current state-of-the-art, which we hope will help spur progress in the field.

机译：计算机视觉和NLP相交的当前方法在自动字幕或图像检索等任务上取得了前所未有的突破。但是，这些方法大多数都依赖于与专门描述视觉内容的注释相关的图像训练集。本文提出了进一步的建议，并探讨了文本描述与图像松散相关的更复杂的情况。我们专注于新闻的特定领域。我们介绍了新的深度学习方法，这些方法解决了来源和受欢迎程度预测，文章插图和文章地理位置问题。提出了一种自适应CNN，该CNN共享所有任务的大部分结构，并且适用于多任务和转移学习。部署了深度CCA进行文章说明，并提出了基于大圆距的新损失函数进行地理位置定位。此外，我们展示了BreakingNews，这是一个新颖的数据集，包含约100K条新闻报道，包括图像，文本，标题，并富含异构元数据。 BreakingNews允许探索所有上述问题，为此，我们使用各种CNN架构以及文本和视觉功能的不同表示形式提供基准性能。我们报告了令人鼓舞的结果，并揭示了当前最新技术的一些局限性，我们希望这些局限性将有助于推动该领域的进步。

著录项

作者
Ramisa, A; Yan, F; Moreno-Noguer, F; Mikolajczyk, K;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles [J] . Burkhard Rost, Gillian H. Millburn, Juan Miguel Cejuela, Database . 2014 ,第0期

机译：tagtog：PLOS全文文章中基因提及的交互式和文本挖掘辅助注释
2. A 0.27e src="/images/tex/33864.gif" alt="^{-}_{text {rms}}"> Read Noise 220- src="/images/tex/33865.gif" alt="mu text{V}/text{e}^{-}"> Conversion Gain Reset-Gate-Less CMOS Image Sensor With 0.11- src="/images/tex/26026.gif" alt="mu text{m}"> CIS Process [J] . Seo Min-Woong, Kawahito Shoji, Kagawa Keiichiro, Electron Device Letters, IEEE . 2015 ,第12期

机译：0.27e src =“ / images / tex / 33864.gif” alt =“ ^ {-} _ {text {rms}}”> 读取噪声220- src =“ / images / tex / 33865.gif” alt =“ mu text {V} / text {e} ^ {-}”> 转换增益Reset-Gate-Less CMOS具有0.11- 的图像传感器 src =“ / images / tex / 26026.gif” alt =“ mu text {m}”> CIS工艺
3. The articles.ELM resource: simplifying access to protein linear motif literature by annotation, text-mining and classification [J] . N Palopoli, J A Iserte, L B Chemes, Database . 2020 ,第1期

机译：艺术品资源：通过注释，文本挖掘和分类简化对蛋白线性主题文献的访问
4. Annotating article errors in Spanish learner texts: design and evaluation of an annotation scheme [C] . Maria del Pilar Valverde Ibanez, Akira Ohtani Pacific Asia Conference on Language, Information and Computation . 2015

机译：注释西班牙语学习者文章中的文章错误：注释方案的设计和评估
5. The effect of different types of image annotations in a scientific text on different learning outcomes in multimedia learning environments. [D] . Hamilton, Heather Suzanne. 2003

机译：科学文本中不同类型的图像批注对多媒体学习环境中不同学习结果的影响。
6. tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles [O] . Juan Miguel Cejuela, Peter McQuilton, Laura Ponting, 2014

机译：tagtog：PLOS全文文章中基因提及的交互式和文本挖掘辅助注释
7. BreakingNews: Article Annotation by Image and Text Processing [O] . Ramisa, Arnau, Yan, Fei, Moreno-Noguer, Francesc, 2016

机译：BreakingNews：图像和文本处理的文章注释
8. Complex Event Processing for Content-Based Text, Image, and Video Retrieval. [R] . Boury-Brisset, A., Bowman, E. K., Burghouts, G., 2016

机译：基于内容的文本，图像和视频检索的复杂事件处理。

BreakingNews: Article Annotation by Image and Text Processing

摘要

著录项

相似文献

相关主题

期刊订阅